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Abstract 

The problem of maximizing non-negative submodular functions has been 
studied extensively in the last few years. However, most papers consider 
submodular set functions. Recently, several advances have been made for 
submodular functions on the integer lattice. As a direct generalization of 
submodular set functions, a function / : {0,..., C}" —>■ R+ is submodular, 
if f{x) -I- f{y) > f{x Ay) + f{x V y) for all x, j/ € {0,..., C}'* where A and 
V denote element-wise minimum and maximum. The objective is hnding 
a vector x maximizing /(x). In this paper, we present a deterministic 
i-approximation using a framework inspired by [^. Moreover, we show 
that the analysis is tight and that other ideas used for approximating set 
functions cannot easily be extended. 

In contrast to set functions, submodularity on the integer lattice does 
not imply the so-called diminishing returns property. Assuming this prop¬ 
erty, it was shown that many results for set functions can also be obtained 
for the integer lattice. In this paper, we consider a further generalization. 
Instead of the integer lattice, we consider a distributive lattice as the 
function domain and assume the diminishing returns (DR) property. On 
the one hand, we show that some approximation algorithms match the 
set functions setting. In particular, we can obtain a 1 / 2 -approximation for 
unconstrained maximization, a (1 —i/e)-approximation for monotone func¬ 
tions under a cardinality constraint and a 1 / 2 -approximation for a poset 
matroid constraint. On the other hand, for a knapsack constraint, the 
problem becomes significantly harder: even for monotone DR-submodular 
functions, we show that there is no “^-approximation for ev¬ 

ery 5 > 0 under the assumption that 3 — SAT cannot be solved in time 

1 Introduction 

Recall that a set function / : 2^ ^ R is called submodular if f{U) + /(TV) > 
f{U n W) A f{U U W) for all U,W Q Af. Optimization problems with sub¬ 
modular objective functions have received a lot of attention in recent years. 
Submodular objectives are motivated by the principle of economy of scale, and 
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thus find many applications in real-world problems. Moreover, submodular 
functions play a major role in combinatorial optimization. Several combina¬ 
torial optimization problems have some underlying submodular structure, for 
example, cuts in graphs and hypergraphs, or rank functions of matroids. 

As a breakthrough result, the problem to find a subset S C J\f minimizing a 
submodular function / has been shown to be solvable in strongly polynomial 
time in |20| . In contrast, the corresponding maximization problem 

max{/(S') : S C J\f} (1) 

for a nonnegative submodular function / is easily seen to be iVP-hard, as it 
contains, for example, MAX CUT as a special case. We refer to (HD as UNCON¬ 
STRAINED SUBMODULAR MAXIMIZATION (USM). 

Recently, a generalization to submodular functions on a bounded subset of the 
integer lattice Z” has also been investigated. For x,?/ S Z” let (x V y)e denote 
maxjxe, He) and {xAy)e denote minjxe, ye} for e G {1,..., n}. Then, a function 
/ : D R on a finite set D of the form = {x S Z”|?i < Xi < Vz S 
{1,..., n}} is called submodular if 

f{x) + f{y) > f{x Vy) + /(x A y) Vx, y & D. 

Clearly, this captures submodular set functions since vectors with entries 0 and 
1 can be seen as incidence vectors of sets and in that case A and V correspond 
to intersection and union. D is called bounded integer lattice. Submodular 
functions on the integer lattice have been studied before, for example, in discrete 
convex analysis, L^-convex functions are of this type. We provide more details 
in “Related Work”. 

Submodularity for set functions is equivalent to the diminishing returns (DR) 
property: For S' C T, e ^ T we have /(S -|- e) — /(S) > f{T -\- e) — f{T), a 
very natural property in many problems. Submodularity on the integer lattice, 
however, is a weaker property in the sense that Diminishing Returns imply 
submodularity, but not the other way round. Soma and Yoshida |34) have 
investigated the setting of DR-submodular functions on the integer lattice. We 
will use the prefix “DR” to describe problems that consider DR-submodular 
functions. 

In addition to the integer lattice, we can also consider a more general structure: 
distributive lattices, that is, lattices where the meet- and join-operation fulfill 
distributivity. We will provide definitions in Section[2l For now, it is important, 
that due to Birkhoff’s theorem, we can represent any distributive lattice as the 
set of all ideals on some poset P and vice versa. Form now on, we always assume 
that distributive lattices T’(P) are (implicitly) given by this poset. 

Maximizing a submodular function subject to some side constraints is also a 
well-understood problem. Typically, these constraints are matroid or knapsack 
constraints, with the special case of a cardinality constraint, which can be seen 
as a matroid constraint for the so called uniform matroid. In contrast to the 
unconstrained setting, considering monotone functions still yields interesting 
problems here. 
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There are corresponding generalizations of matroids for both types of lattices 
presented above: On the integer lattice, we can consider integer poly matroids, 
on distributive lattices, there are poset-matroids. Hence, the problem of max¬ 
imizing a submodular function subject to these constraints can be generalized 
as well. 

1.1 Our results 

In this work, we present approximation algorithms for several constrained and 
unconstrained submodular maximization problems, in particular, we consider 
the following. 

USM on the integer lattice. Given a submodular function f : D ^ R+ on 
a bounded integer lattice D, we show how to find a pseudopolynomial i- 
approximation. Moreover, we show that the analysis is tight and discuss some 
difficulties for obtaining better approximation guarantees using existing tech¬ 
niques that have been applied successfully for set functions ((7=1). Note that 
the one-dimensional case where D = {0,1,...,(7} already illustrates that ob¬ 
taining an exact polynomial algorithm is not possible, since any function on D 
is submodular. A preliminary version of these results was published in m- 
DR- USM on the distributive lattice. Given a DR-submodular function / : 
V{P) ^ R+ on a distributive lattice D{P), we show how to generalize the 
algorithm of Buchbinder et al. for set functions to obtain a randomized i- 
approximation. 

Cardinality and Poset Matroid Constraints on the distributive lattiee. Given a 
monotone DR-submodular function / : ViP) —t K.+ on a distributive lattice 
X>(P), we obtain a (1 — Ye)-approximation for maximization under a cardinality 
constraint and a ^-approximation under a poset matroid constraint. 

Hardness on the distributive lattice. While the above results might suggest 
that submodular maximization on the distributive lattice is not fundamentally 
different from the integer lattice or even from set functions, this surprising re¬ 
sult shows otherwise. Maximizing a monotone, DR-submodular function under 
a knapsack constraint is well understood and approximable in the previously 
mentioned cases. However, we show, that for the distributive lattice, there is 
no 2(^°s("'^^^“^^^'*“^-approximation for every i5 > 0 under the assumption that 
3 — SAT cannot be solved in time . 

1.2 Related Work 

A comprehensive study on USM has been done by Feige, Mirrokni and Vondrak 
in m who provide and analyze several constant approximation algorithms for 
USM. In particular, they present a simple Local Search algorithm that yields a 
^-approximation. Using a noisy version of / as objective function, they could 
improve the performance guarantee to |. Feige et al. in m also showed that 
we cannot hope for a performance guarantee lower than i. They could prove 
that any ^ + e -approximation would require an exponential number of queries 
to the oracle. For symmetric submodular functions, they already show that the 
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ratio is tight. Subsequently, Oveis Gharan and Vondrak |30j and Feldmann, 
Naor and Schwartz m improved the approximation ratio to 0.41 and 0.42 
respectively and finally Buchbinder, Feldmann, Naor and Schwartz closed the 
gap and gave a randomzied i-approximation for USM in [^. They also present 
a deterministic ^-approximation using a similar idea. Both algorithms use a 
“Double Greedy” framework that starts with two different sets and, for a fixed 
order of the elements, decides in each step which of the two sets should be 
modified using the given element. Later, Buchbinder and Feldman showed in [5] 
how to derandomize this algorithm to obtain a deterministic ^-approximation. 
For several special types of submodular functions, better approximation ratios 
can be proved. For example, Goemans and Williamson HZ] provide an 0.878- 
approximation for MAX CUT and a 0.796-approximation for MAX DiCUT based on 
semidefinite programming and Ageev and Sviridenko give a 0.828-approximation 
for MAXIMUM FACILITY LOCATION [T|. 

Furthermore, USM has applications in marketing in social networks and revenue 
maximization |22| . Submodular function maximization also plays a crucial role 
in algorithmic game theory. For example, Dugmi et al. use USM as well as 
constrained submodular maximization for analyzing auctions in and Schulz 
and Uhan use USM as a subroutine to calculate the least core in cooperative 
games m- 

Since vectors with entries 0 and 1 can be seen as incidence vectors of sets, sub¬ 
modular functions on the integer lattice can be seen as a direct generalization 
of submodular set functions and in particular, all hardness results still apply. 
Moreover, the integer lattice can be interpreted as a specific distributive lattice 
that consists of disjoint chains. However, we remark that this correspondence 
is only pseudopolynomial, i.e. we need a pseudopolynomial number of elements 
to represent the integer lattice as a distributive lattice. Moreover, Soma and 
Yoshida observed that, for a DR-submodular function on the integer lattice, 
there is an equivalent submodular function on a Boolean lattice whose size is 
pseudopolynomially bounded (by n and the upper bound for the components). 
We refer to a talk given at a Hausdorff Trimester Program which is also publicly 
available |32| . In particular, any approximation algorithm for submodular set 
functions yields a pseudopolynomial approximation of the same guarantee for 
DR-submodular functions on the integer lattice, where the running time de¬ 
pends on the bound C. Thus, assuming DR-submodularity, the challenge on 
the integer lattice is developing polynomial algorithms. In particular, there is 
a polynomial ^-approximation for DR-USM on the integer lattice [35|. How¬ 
ever, our hardness result suggests that no such easy reduction to submodular 
set functions exists for distributive lattices. 

In discrete convex analysis, several special cases of submodular functions on 
the integer lattice have been investigated. For example, L^-convex and M^- 
concave functions are submodular and M'^-concave functions can be maximized 
in polynomial time m- 

As mentioned before, the problem of minimizing a submodular function is solv¬ 
able in polynomial time. This is not only true for submodular set functions, 
but even for submodular functions on distributive lattices, or more precisely on 
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ring families m- On the other hand, nothing is known for maximization on 
the distributive lattice. 

A considerable amount of work deals with maximizing both monotone and non¬ 
monotone submodular functions subject to some side constraints. Most of these 
papers consider submodular set functions. The side constraints that are typ¬ 
ically investigated are matroid and knapsack constraints and combinations of 
these. The first result in this area goes back to Fisher, Nemhauser and Wolsey, 
who considered monotone set functions and presented a (1 — i/ej-approximation 
for a cardinality constraint |29j and a i/p+i-approximation for P matroid con¬ 
straints m- Calinescu et al. improved this to (1 — i/e) for one matroid con¬ 
straint [7]. Nemhauser and Wolsey [2H] showed that any approximation better 
than (1 — i/e) requires an exponential number of queries if / is accessed via a 
value oracle even for the case of a cardinality constraint and monotone functions. 
Moreover, Feige showed that Max-fc-Cover, which is a special case of maximiz¬ 
ing a monotone submodular function subject to a cardinality constraint is hard 
to approximate to within a factor of (1 — ^ — e) for any fixed e unless P = NP. 

In contrast to the other complexity result, the submodular function is given by 
a closed form and not just accessed by an oracle. 

For monotone set functions, Sviridenko |36] provided the first (1—i/ej-approximation 
for a knapsack constraint. Since a cardinality constraint can also be interpreted 
as a knapsack constraint, the same hardness result applies. 

Inaba et al. [21] first considered constrained problems on the integer lattice 
and showed how to obtain a pseudopolynomial (1 — i/e)-approximation for the 
monotone knapsack case. In [34], Soma and Yoshida presented (1 — i/e — e)- 
approximations with polynomial running times depending on ^ for DR-submodular 
monotone functions and a cardinality, polymatroid or knapsack constraint. Fur¬ 
thermore, they achieve the same approximation guarantee for submodular mono¬ 
tone functions and a cardinality constraint using an algorithm whose running 
time depends logarithmically on the ratio of the maximum function value and 
the minimum positive increase. 

In |5], Chekuri, Vondrak and Zenklusen consider the very general problem of 
maximizing a non-monotone submodular function on a down-monotone family 
of sets, which in particular contains the previously mentioned constraints. They 
also provide a concise overview over previous results in non-monotone maximiza¬ 
tion and we refer to their paper for details. To the best of our knowledge, no 
results for non-monotone constrained maximization on the integer lattice exist. 
While our main interest in SMBIL is of a theoretical nature with the ultimate 
goal to gain a better understanding of submodular function maximization, there 
are applications where submodular functions on the integer lattice play a cru¬ 
cial role. For example, Bolandnazar et al. (|1]) showed that Assemble-to-Order 
Systems for supply chain management can be optimized by minimizing a sub¬ 
modular function on the integer lattice. [21] discusses several applications for 
submodular maximization on the integer lattice, e.g. budget allocation and sen¬ 
sor placement. There is also an alternative way to interpret the bounded integer 
lattice: Let x G Z" denote the incidence vector of a multiset where entries in 
X specify the multiplicity of individual elements. Then SMBIL can be seen as 
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maximizing a submodular function on multisets containing at most C copies of 
each element which gives rise to possible applications. This illustrates for exam¬ 
ple how sensor placement can be seen as an application of SMBIL by allowing 
the possibility to place multiple sensors at the same location. 

In Figure [1] we provide a graphic summary of all problems considered in this 
paper and related problems. An arrow from one box A to box B indicates 
that problem A generalizes B. We denote distributive, integer and Boolean 
lattices by DL, IL and Boolean respectively, where Boolean is equivalent to 
the the well-studied case of set functions. The diminishing returns property is 
denoted by DR and we remind the reader that every submodular set function is 
DR. This is not given on integer or distributive lattices, where DR-submodular 
functions are a subclass of submodular functions. Those cases where we only 
have pseudopolynomial algorithms are marked as such by the additional tag 
"pseudo". Due to results on hardness of approximation for USM and (< k, 
Boolean), all approximation guarantees except for (poset matroid, DL, DR) 
and (SMBIL) are tight (up to e). 

The paper is organized as follows: The necessary terminology is introduced in 
Section Then, we first investigate unconstrained maximization on the integer 
lattice in Section[3] In Section|3]we present algorithms for several maximization 
problems for DR-submodular functions on the distributive lattice and in Section 
0 we present a strong hardness result. 

2 Preliminaries 

For submodular set functions, it is iVP-hard to decide whether there exists a set 
S such that f{S) > 0, as mentioned by Feige et al. in [T3]. Since our main goal 
is finding good approximation algorithms, throughout this paper, we assume 
that all submodular functions are nonnegative and given by a value oracle. 

For a vector x G 7A we denote by {x\xj = k) the vector where all but the entry 
Xj remain the same and Xj is set to k. As usual, [n] denotes the set {0,1,..., n} 
and e® G {0,1}" denotes the vector where e) = 1 and e) = 0 for all j ^ i. 

2.1 Submodular maximization on integer lattices. 

Given a bounded integer lattice D — {x G lA\li < Xi < UiMi G {1,..., n}} and 
a submodular function f : D ^ R+, we consider the problem of maximizing / 
on D: 

max{/(x) : x G D}. (2) 

We will refer to (l2|) as Submodular maximization on a bounded integer 
LATTICE (SMBIL). For ease of notation, we will from now on assume that 
li = 0 and Ui = C \/i G {1,... ,n}. Thus, we prove all results for a bounded 
integer lattice of the form {0,1,..., G}", but all results in this paper can be 
easily generalized to any bounded integer lattice as defined above. 

As mentioned before, SMBIL generalizes USM, thus the hardness of approx¬ 
imation for USM holds. The integer lattice with C = 1 underlying USM is 
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(a) general submodular functions 



(b) monotone submodular functions 


Figure 1: Overview of all problems, those considered in this paper are shaded. 


also called the Boolean lattice. Directly generalizing the Diminishing Returns 
property for set functions, it can be stated for the integer lattice as in |34) : 
f{x + Xi) — f{x) > fiy + Xi)~f{y) for vectors x <y where Xi S 0,1" denotes the 
vector whose entries are 0 everywhere except for component i. They also remark 
that a function / is DR-submodular on the integer lattice if and only if / is sub¬ 
modular and coordinate-wise concave, i.e. f{x+Xi)—fix) > f{x+2xi)—f{x+Xi) 
for any x and i [33] . Our definition of submodularity differs slightly from that of 
Soma at ah: while we only define a submodular function on D, they define sub¬ 
modularity on Z", but restrict to vectors x with 0 < Xi < C for all optimization 
problems. These formulations are equivalent in the sense that any submodular 
function on D can be extended to a submodular function on Z”. 

2.2 Posets and Distributive Lattices 

For a partially ordered set (poset) P = {Af, r<), an antichain is a set S' C A/” of 
incomparable elements and a chain is a set S Q Af where each pair of elements 
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is comparable. An ideal is a set 5 C A/” where s € S implies t € S ior all t ^ s. 
By X>(P) we denote the set of all ideals for P. 

A lattice is a poset P any two of whose elements x,y G P have a meet x A 
i.e. a unique greatest common lower bound, and a join a; V y, i.e. a unique least 
common upper bound. A lattice is called distributive, if meet and join satisfy 
distributivity. One example for such a lattice is the (bounded) integer lattice 
with component-wise minimum and maximum as join and meet. An element x 
of a lattice is called join-irreducible, if it cannot be characterized as the join of 
two elements y, z ^ x . 

For a poset P, the pair {'D{P), C) forms a distributive lattice with union and 
intersection as join and meet. Conversely, any distributive lattice is isomorphic 
to the lattice of ideals w.r.t. the induced poset on the join-irreducible elements 
of that lattice. This fact is known as Birkhoff’s theorem [3]. Therefore, we will 
assume throughout this paper that a distributive lattice is given in the form 
(X>(P),C) for a poset P. For the example of the bounded integer lattice of 
dimension n, the corresponding lattice of ideals would be the set of ideals X>(P) 
on a poset P consisting of n disjoint chains of length C. For a more extensive 
introduction to lattice theory, the reader is referred to [3]. 

For a poset P = {M, ^), we can consider the following generalization of SMBIL. 

max{/(5) : 5 e V{P)}, (3) 

where PiP) is the collection of all ideals in the poset P and / a nonnegative 
submodular function on V{P). We refer to ([31) as Submodular MAXIMIZATION 
ON DISTRIBUTIVE LATTICES (SMDL). 

The DR-property naturally generalizes to distributive lattices: f{S-\-x) — f{S) > 
f{T-\-y) — f(T) for S CT with S,T G P{P) and x <y such that S-\-x,T-\-y G 
P(P)- This definition contains both the definition on set functions and on the 
integer lattice. 

2.3 Poset Matroids 

Integer polymatroids can be seen as a generalization of matroids to the integer 
lattice. These form special distributive lattices, as we saw above, and matroid 
structures can be extended further to distributive lattices as well. One way of 
doing so are poset matroids, and we are going to give a brief introduction here. 
The main difference to ordinary matroids is that the ground set is a poset and 
matroid axioms only have to hold for ideals of this poset. 

Definition 1. Given a partial order P = (W, ^), a nonempty family P C X>(P) 
forms the independent sets of a poset matroid if the following two properties 
hold: 

(Ml) Y €P,X CY,X G V{P) ^X gT 

(M2) X,Y gT, |A| < |r| ^ 3e e y \ a with X\J{e}G T. 
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Figure 2: Multigraph and poset on the edges, illustrating that independent 
ideals in the graphic matroid do not form a poset matroid. 


We remark that matroids are the special case of poset matroids where P consists 
only of pairwise incomparable elements and integer polymatroids correspond to 
the special case where P consists of disjoint chains. Therefore, by considering 
all subsets instead of the sets in T>{P) we obtain the usual matroid definition. 
We use the usual matroid terminology for poset matroids, for example, sets in T 
are called independent and a basis in a poset matroid is a maximal independent 
set. 

Poset matroids were introduced by Dunstan, Ingleton and Welsh m and are 
also known as distributive supermatroids. Moreover, they are a special case of 
ordered matroids as introduced by Faigle m and several other structures. They 
share many structural properties with matroids, in particular, Faigle showed 
that, under a consistency assumption on the cost, it is possible to compute a 
minimum cost basis using a greedy algorithm HU. Moreover, Barnabei et al. 
presented a strong base exchange property [5] and for two poset matroids on 
the same poset, finding a common independent set of maximum cardinality is 
possible in polynomial time as shown by Tardos EH- She also showed that the 
intersection problem is NP-hard for two poset matroids on posets with different 
partial orders. 

However, in contrast to polymatroids, as far as we know, there exists no poly¬ 
hedral description for poset matroids. 

It is important to realize that poset matroids are not just the intersection of 
independent sets of a matroid {E, P) and ideals of a poset on E. Consider 
for example the graphic matroid on the multigraph in Figure [5J Then the sets 
{ 61 , 63 } and { 62 } are independent ideals, but property (M2) is not fulfilled. 


3 Approximating SMBIL 

In this section, we consider unconstrained maximization of a general nonnegative 
submodular function on the integer lattice (SMBIL). In particular, we do not 
require DR-submodularity. 

We first present a 1 / 3 -approximation for SMBIL which is inspired by H] . Then, 
we show that our analysis is tight and discuss approaches for a randomized 
Double Greedy algorithm. “Double Greedy” refers to the idea of starting with 
two vectors a and b and modifying them until both vectors are equal, while 
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ensuring / never decreases. In the beginning at = 0 and bi = C \/i S {1,..., n}, 
i.e. initially, a (resp. b) is the unique minimal (maximal) element in D. Now, we 
traverse the components in a fixed order, say 1,... n. For a given index fc, we 
change ak and bk while maintaining a < b without decreasing the submodular 
function /. In particular, we hnd the best possible way to modify a and b while 
changing only component i. We then choose the modification which yields the 
biggest increase in / and set both vectors to that value. 

In the scenario where the integer lattice is bounded by I, our Algorithm [T] coin¬ 
cides with the one in [5] when vectors are interpreted as characteristic vectors 
of sets. 

Theorem 2. Let f : [C']” —>■ K.+ be a nonnegative submodular function. Then 
Algorithm\^ is a 3-approximation for SMBIL and has running time 0{CnT) 
where T is the time for one call to the oracle representing f. 

3.1 Proof of Theorem [2] 

The proof of Theorem [2] relies upon two lemmas. 


Algorithm 1: Generalized Double Greedy for SMBIL 
Input: A bounded integer lattice defined by a bound C and a dimension 
n, a nonnegative submodular function / on [C]” 

Output: A vector a € [C]" 

Set aj = 0, bj = C VI < j < n, i = 1 
for k = 1 to n do 

6a,i = maxcg[c] = c) - /(a*“fy 

Sb,i = maxce[c] fib"~'^\bl~^ = c) - 

if 6a, i > 6b,i then 

Let c' be the maximal number among those for which 6a,i is 
obtained. 

fy = (a*-i|ar^ =c'), V 

|_ =a\ = {b^\bl = c') 

else 

Let c' be the minimal number among those for which 6b,i is 
obtained. 

a* = fy-i, V = =c') 

L = b\ fy+i = (fy|4 = c') 
i = i-\-2 

return fy" 


First, we show that Algorithm [T] really is a Greedy algorithm in the sense that 
/ never decreases: 
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Lemma 3. Let f : [C]” —>■ K+ be a submodular function. Then for all 1 < i < 
2n and vectors a*, 6® as in Algorithmic the following holds: f{a^) > o.'n-d 

fm > /(&*-'). 




and 6® ^ will be 


Proof. Let k be the component in which the vectors 
changed in an iteration of the loop. 

By definition, Sa,i and Sb^i are nonnegative. Suppose that a is changed first, i.e. 
ba,i > bb^i, then clearly, the lemma holds for and a* and P = . Now we 

show that the change from V to 6*+^ does not lead to a decrease in /: 


We have = al) + /(a*-i|ar^ 

by submodularity of /. 


= C)> /(a 


2—1 1^^—1 


= al) + 


As /(a 


i-l I _i-l 


= al) > f{a 


i-l I _i-l 


= c) V ^ < c < C, the above implies 


-/(fe*-') >/(« 


= al) - f{c 


= C)>0. 


Since P ^ = P and 6®+^ = (&® ^\bl ^ = al), it follows that f{P~^^) > f{P) as 


desired. 

The case where 6b.i > Sa.i can be shown in the same way. □ 


Let OPT denote a fixed optimal solution for SMBIL. In order to bound the 
value of a solution of Algorithm [T] with respect to the optimum, we define 
OPT® = {OPT V a®) A 6® analogous to [6]. Consequently, OPT° = OPT and 
OPT^ri ^ ^2n ^ Lemma H we show f{OPT^-^) - f{OPT^) < /(a®) - 

/(a*"^) + f{P) - f{P~^), i.e. the decrease in OPT® (going from OPT^ to 
OPT^^) is bounded for each change of the vectors a or b. 

Then, as in [6], Lemma 0] can be used to prove Theorem [2] 

2n 2n 

f{OPT°)-fiOPT^^) = j:if{OPr-^)-f{OPT^)) < E (/(a')-/(a*-')) + 

2=1 2=1 

2n 

E (/(&*) - fib^~")) = /(«'”) + /(&'") - fia°) - f{b°) < /(a'") + /(6'"). 

2=1 

This is equivalent to f{OPT) < 3f{a^^) and the algorithm returns a^®®. Since 
the running time is obvious, this concludes the proof of Theorem 0] except for 
Lemma 01 


Lemma 4. For a submodular function f : [C]®® M+ the following holds in 
Algorithmic for all 1 < i < 2n where OPT^ := (OPT V a®) A P: 

/(OPP®-i) - /(OPT®) < fiP) - /(a®-i) + /(6®) - /(6®-i). 

Proof. Let us consider the case where a® ^ a®“^ and let k be the index, where 
the vectors differ. If al < OPTk then OPTjT^ = OPTl since bl = and 
thus OPT'^~^ = OPT^. Since Lemma 0] implies that the right-hand side of the 
equation is nonnegative, the inequality holds. 

So we assume now that al > OPT].. Since bl > al, we have OPTl = al, all 
other entries of OPT^~^ remain unchanged. Moreover, = 0, so OPTl~^ = 


11 


OPTk- Submodularity of / implies 

fiOPT^) + = OPTk) > /(OPr-i) + /(P->r' = O (4) 

^ /(OPT*) + /(P->r' = OPTk) - /(P-i) 

> /(opT*-i)+/(p->r' = <) - 

Suppose that f{OPT^~^) — f{OPT^) > /(a*) — /(a*“^), otherwise we are done. 
Using this assumption and (|3]) yields 

= OPTfc) - /(p-i) + /(OPr-i) - (/(a*) - /(a*-i)) (5) 

>/(p-i|5^-i = OPTfc) - /(P-i) + /(OPT*) 

>/(opT*-i)+/(6*->r' = 4) - /(&*■')■ 

But we have /(a*) - /(a*-i) > /(6*"^|6r^ = c) - /(&*”^) V < c < . 

This is true by design of the algorithm for the first change in an iteration of the 
loop. For the second change in an iteration, the other vector (in this case P~^) 
cannot improve further, so the claim holds as well. 

Setting c = OPTk and (H]) imply 

0 > /(P->r' = OPTk) - /(P-1) - (/(a*) - /(a*-i)) (6) 

>/(5*-i|6r'=4)-/(6*-i). 

But in the proof of Lemma|31 we have shown that = ®fe) “ ^ 

0, so ([6]) is a contradiction. 

The case 6* ^ can be treated analogously and if neither case applies, then 
clearly OPT* = OPT^-* as well. □ 

3.2 The guarantee of 4 is tight 

Since Algorithm [T] generalizes the deterministic algorithm in [^, the tight ex¬ 
ample they provide also works for our algorithm. But our example has a few 
additional properties: First, even if we do not prescribe the order of the compo¬ 
nents in advance and instead choose the index that results in the biggest increase 
in / for each step, the algorithm does not yield a better solution (see Theorem[5] 
below) which is not true for the example provided in [B]. However, it is not pos¬ 
sible to attain a ^-approximation by choosing a particular good order. This was 
shown by Huang and Borodin in [53] . They examine classes of Double Greedy 
algorithms and provide an upper bound of -|-e « 0.385-I-e on approximation 
ratios achievable by those algorithms. These include in particular the algorithm 
described above. Moreover, we will show in the next subsection that our ex¬ 
ample also has implications when trying to generalize the 1 / 2 -approximation 
presented in jH]. 

Theorem 5. For an arbitrarily small constant e > 0 there exists a submodular 
function for which Algor ithm\^ provides only a | + e-approximation. This re¬ 
sult still holds if the components are ordered such that at any time the chosen 
component yields the biggest increase in f. 
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Figure 3: Each node corresponds to a vector in {0,1,2}^, written next to it, 
with the value of / within the node. The picture can be read similar to Hasse 
diagrams of posets: There is a line connecting two vectors, if their Li-distance 
equals 1. If we think of the edges as being directed upwards, then the meet 
of two vectors is their biggest common predecessor, and their join the smallest 
common successor. 


Proof. Consider the following submodular function / : {0,1,2}^ —R+. Let 
f{a) = min{|{ (Xi . (Xi o}U{ tti : Oi < 2}|}. Now we define /(a) = 1 + e if a 
consists of either the entries 2,0,2 or 2,0,0 in any order. We set /(2,0,1) = 
/(O, 2,1) = 1 + e. For all other vectors, we set /(a) = f'{a). It can be checked 
that / is indeed submodular. To give a better intuition, Figure[3]illustrates this 
function. 

We analyze Algorithm [1] for this instance: Obviously, the optimum is the vector 
(1,1,1) of value 3, but Algorithm [T] terminates with a set of value 1 + e: 

In the first iteration, the maximal possible gain in / is 1 + e = i = 5t„i- So 
we we set a\ =2 and b\ = 2. Next, (5b3 = l + e>5a3 = 0 and thus = 0 and 
a| = 0. 

Therefore, we now have a* = (2, 0,0) and b'^ = (2, 2,0), both of value 1 + e and 
the algorithm returns the vector of' = b^ = (2, 2,0) of value 1 + e. 

Since the maximal possible gain in / is 1 + e for the first two indices independent 
of the order, the same order could have been chosen by an algorithm that always 
processes the index which maximizes the gain in /. Moreover, the tie-braking 
rule for 5 does not influence the value of the output here. □ 
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Figure 4: The values in the nodes give a submodular function for any a: > 1. 

While Algorithm [T] went “too far” when choosing the next entry, a variant of the 
algorithm that only changes the vectors by one at a time can be arbitrarily bad 
here. To see this, consider the example on [2]^ in FigurelH If our algorithm was 
only allowed to add 1 to a or delete 1 from b and chose the better of these two 
options, the output could not be (0, 2) which has an arbitrarily high value of x. 


3.3 Difficulties in Designing Algorithms 

We already mentioned that submodularity is a much weaker property on the 
integer lattice than on set functions. This also makes it hard to generalize 
some algorithms without assuming additional properties like diminishing re¬ 
turns. Consider for example the one-dimensional integer lattice, on which any 
function is submodular. Therefore, we cannot expect running times that are 
better than pseudopolynomial for any algorithm with bounded approximation 
ratio, unless the function fulfills other properties as well. This example also 
illustrates that some structural properties that are key to designing good algo¬ 
rithms on set functions do not hold any more. One such result is used in the 
local search approach presented in m- The authors rely on a generalization of 
a result about local optima, stating that the value of a local optimum x is at 
least as large as the value of all sub- and supersets. For the integer lattice, this 
statement is no longer true. Moreover, “continuous Greedy algorithms” that 
were used in various settings and also provided a better approximation ratio for 
USM, rely on component-wise concavity along non-negative directions for the 
so-called multilinear relaxation. Since such a relaxation coincides with the func¬ 
tion values on integer points, this property cannot be obtained without stronger 
assumptions like DR. 

As a generalization of the Double Greedy approach was possible, it seems rea¬ 
sonable to ask whether further results using similar techniques can also be gen¬ 
eralized to SMBIL. For USM, [5] also presents a randomized “Double Greedy” 
algorithm which gives a guarantee of They decide with probability propor¬ 
tional to the increase in / whether to add a given element to one set or delete 
it from the other. In the context of incidence vectors, this is equivalent to 
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choosing whether entries ai and bi are set to 0 or 1. We show that a similar 
analysis cannot work if we adapt their idea to our algorithm. We consider two 
natural strategies of generalizing the randomized algorithm above. One is the 
following: For given vectors a, b and an index i we can consider all possibilities 
to increase ai or decrease bi such that a < b remains true and choose one of 
these possibilities at random (again proportional to the increase in /). We will 
show this leads to arbitrarily bad solutions. 

The other alternative is more similar to our previous algorithm: We determine 
the best choice for a and b and then choose between the two options with prob¬ 
ability proportional to the <5-values, i.e. the maximal possible gain in /. While 
it is possible that this actually is a Y 2 -approximation, we can so far only show 
that this randomized algorithm gives the same guarantee as the deterministic 
version. Indeed, we will show that an analysis as in which bounds the ex¬ 
pected decrease of /(OPT*) in each step cannot prove a guarantee better than 
1/3 by examining the example presented in Figure [3] 

So, while it might still be true that this randomized version of our algorithm 
actually is a Y 2 -approximation (for example, the worst case expected value in 
the given example is 1.5 -I- e while the optimum is 3), we would require an 
analysis that takes a more global view. 

We should note that for the Boolean lattice, both these approaches are identi¬ 
cal and correspond to the randomized algorithm in [3]. We now analyze both 
algorithms. 

3.3.1 Randomized choice over all possible solutions 

Lemma 6. Consider the following randomized version of Algorithm 1: Assume 
an order of indices and consider index i. While ai < bi, randomly set either 
a = {a\ai = c) or b = {b\bi = c) where the choice is made over all ai < c < bi 
proportional to the increase in f if it is positive. If the increase in f is non¬ 
positive for all possibilities, we arbitrarily choose an option where the increase in 
f is 0, but a or b are changed. This algorithm for SMBIL can perform arbitrarily 
bad. 

Proof. First, we remark that in the case where the increase in / is non-positive 
everywhere, a choice as above exists. We now analyze the following instance: 
Given C G N, e > 0, we define a submodular function / : [C]^ ^ M as follows 


0, if a; = (0,0) or cc = (C, C) 



£ else 


Our randomized algorithm starts with a = (0,0) and b = (C, C). We start with 
index 1. Until oi = 61 , we change entry ai or bi and choose a value such that 
Oi < bi at random proportional to the increase in /. I.e., for the first step, we 
have 2C options where the increase in / is positive, all but one (setting oi = C) 
will be taken with probability ■ 

After the first step, either bi < C or ai > 0. In the second case, if ai < C, the 
only options now to increase / are changing bi or setting oi = C. Therefore, 


15 



after two steps either ai = C or bi < C and depending on which of these holds, 
the return value will be 1 or £. Thus, the probability for the algorithm to return 
a vector of value s is 


c-i 

Pr[b is changed first] + Pr[ai is set to i first, then b is changed] 

i=l 


Cs 

(2C- l)e+ 1 


£ 

{2C-l)e+l 


c-i 

E 


i=l 


(C — i)e 
{C — i)e + 1 — e 


We now show that this expression converges to one for C —>■ oo, thus showing 
that the expected return value of the algorithm converges to e. 

To achieve that, it suffices to find a lower bound converging to one. After 
rewriting the above expression we obtain: 


Ce 

{2C-l)e+l 


£ 

(2C'-1)£ + 1 


c-i 


E 


(C - i)e 
{C — i)e + I — e 


Ce 

-2Cs+l 

Ce 

~2Ce + l 


e [C — i)e 

(2C'-1)£ + 1 ^ {C -i)e + l 
c-i , 

_f_ y (i _i_ 

(2C'-1)£ + 1^V (C-i)e+l 


Ce 

[C - 1)£ 


2Ce + l 

(2C'-1)£ + 1 

(20-1)£ + 1 

Ce 

(O - 1)£ 

1 

2Ce+l 

(20-1)£ + 1 

(20-1)£ + 1 

Ce 

(O - 1)£ 

ln(0 - 1) + 1 

2Ce + l 

(20-1)£ + 1 

(20-1)£ + 1 


c-i 


E 


1 

(C — i)e 


c-i 

E 




1 

i 



0=1 for C —>-00 


For the last inequality, we used the fact that the partial sums of the harmonic 
series can be bounded by In, the rest is simple arithmetic. □ 

Note that variations like randomizing over combinations of values for Oi and 
bi and choosing proportional to the sum of the increases in / show a similar 
behavior. 

3.3.2 Randomized choice over the best possible solutions 

Lemma 7. Consider the following randomized version of Algorithm 1: Instead 
of deciding to ehange or depending on whether da,i > Sb^i, randomly 
change a or b proportional to max{0, 5a,i} and max{0, Then adapt the 

other vector as before. For this algorithm, there is no constant c < 1 such that 
EifiOPTy - /(OPT*)] < c • E[fiC) - /(a*-i) + /(P) - /(P-i)]. 
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Unlike in [B], where the above statement is true with c = 1 / 2 , an analysis that 
bounds the expected decrease of OPT® by the expected increase in a and b in 
each step cannot yield an approximation factor better than 1 / 3 . 

Proof. As before, OPT^ is defined as (OPTVa®) A&® for a fixed optimal solution 
OPT. Consider the example presented in the previous section in Figure [31 No 
matter how we choose the first index j, aj = 2 ot bj =0 both with equal 
probability. Thus, OPT^ consists of the entries 0,1,1 or 2,1,1 and thus has 
value 2 in both scenarios which implies 

P[/(OPT0) - /(OPPi)] = i . ((3 - 2) + (3 - 2)) = 1 and E[f{a^) - /(««) + 
/(&')-/(M] = 2-^(l+e). 

Note that the statement is true no matter how the order of indices is chosen. □ 

4 DR-submodularity on Distributive Lattices: Al¬ 
gorithms 

In the rest of this paper, we consider more general domains, namely distributive 
lattices, but only DR-submodular functions. In this section, we examine several 
problems, that are well-understood for set functions and DR-submodular func¬ 
tions on the integer lattice and propose algorithms. First, we investigate maxi¬ 
mizing monotone functions subject to a poset matroid constraint. Moreover, we 
consider the special case of uniform poset matroids, i.e. a cardinality constraint. 
Then, we generalize the double greedy approach to the unconstrained case. 

4.1 Monotone maximization subject to a poset matroid 
constraint 

Given a poset P = (A/”, A), let / : X>(P) —>■ K.+ be DR-submodular and mono¬ 
tone. For a poset matroid (P, P), we consider the problem 

max{/(5') : S € P}. 

We show a Y 2 -approximation guarantee for a general poset matroid and a guar¬ 
antee of (1 — i/e) for a cardinality constraint, which is the special case of a 
uniform poset matroid. These guarantees were shown by Fisher, Nemhauser 
and Wolsey for matroid constraints on the Boolean lattice (En],[Ii]) and the 
cardinality result is also presented in [18]. For cardinality constraints, this is 
tight in the sense that any algorithm evaluating / a polynomial number of times 
cannot yield a better approximation guarantee than (1 — i/e) |2^. There is also 
a (1 — i/e)-approximation for a matroid constraint (|7|) on the Boolean lattice 
and a (1 — i/e — e)-approximation for a polymatroid constraint on the integer 
lattice. However, these are based on continuous greedy algorithms and rounding 
a vector in a downward closed polytope, and it is not clear how to generalize 
these techniques for distributive lattices as we have no appropriate polyhedral 
description of poset matroids. 
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We start by describing and analyzing the algorithm for poset matroid con¬ 
straints. Then, we give a better analysis for a cardinality constraint. The 
algorithm is a simple Greedy procedure: Starting from the empty set, we con¬ 
sider a minimal element among those not yet processed in the partial order and 
choose the one that yields the biggest increase in /, compared to the current 
set. If including this element would result in an independent set w.r.t. the poset 
matroid, we add it, otherwise, we discard the element. This procedure yields 
a linear extension xi,.. .Xn of the partial order. A formal description can be 
found in Algorithm [5] 


Algorithm 2: Greedy with poset matroid constraint. 

Input: A poset matroid f : —>■ M-i- DR-submodular and 

monotone 

Output: A set S G T 

Set S' = 0 

for t = 1 to n do 

Select Xt = argmax,j, &M,S+x&V{P) + — f{^) 

if S + Xt G T then 
\_S = S + xt 

return S 


Theorem 8. Algorithm\^is a ^/ 2 -approximation for monotone DR-submodular 
lattice maximization subject to a matroid constraint and a {l—^/e)-approximation 
for a cardinality constraint. 

For the proof, we will use a known lemma, which we restate here: 

Lemma 9 i|15|l. Let nonnegative numbers Ci, pi for i G 1,..., K with pi-i > pi 
for all i and ^ t for t G 1,..., K be given. Then X]i=i Si=i Pi 

Proof of Theorem Let K be the size of the set S returned by our Greedy 
algorithm. For ease of notation, we introduce S° C S^ C ... C with S° = 0 
and = S. That means, S* marks the progression of S throughout the 
algorithm. Let OPT denote a fixed optimal solution. Now we go through the 
total ordering xi,.. .,Xn and define at-i as the number of elements in OPT 
that we considered before reaching Si, starting from the last element added to 

s*-i. 

More formally, let r® be the minimal index such that S® C {xi,.. .x^i}, for 
convenience we define xk+i = n-f 1. In other words, xi,... ,x\. are the elements 
considered by the Greedy algorithm at the point where S is set to S® and 
in particular {x^i} = S® \ S®“^. Then ai = |OPT n {x^i,. ■. ,Xj.i+i-i}\. We 
illustrate this definition in an example in Figure [S) 

We start by showing that a fulfills the conditions of the above Lemma, that 
is ELi < t for t G 1,...,K. By construction of our Greedy algorithm. 
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Figure 5: Partial order with induced total order given by indices. Consider 
OPT = {x 4 ,X 5 ,xe} and S = {xi,X 3 ,xe}, then ai — 0,(72 = 2,CT3 = 1 and 
= 3. 


5* is a maximal independent set in {xi,..., Therefore, by poset 

matroid properties, it also has maximum size and any independent subset of 
{xi,..., x^t+i-i} contains at most |S'‘| = t elements. As the numbers (Ti, ..., aK 
are just one way of counting the elements of OPT by grouping them into inter¬ 
vals on the given linear extension (w.l.o.g. = 1), we obtain 

t 

Y,^^ = \OPTf^{x4 5 ■ • ■ 5 1 X } I ^' 

i=l 

The last inequality holds as OPT is an independent set and thus OPT n 
{xi,..., x^t+i-i} is independent as well. 

Next, we show that the differences pi := /(5'*) —/(5'*“^) decrease as i increases. 
We distinguish two cases. If Xj.t and x^-i+i are not comparable, we could have 
added Xj.i+i to 5'*“^, and thus, xj. must have been the better option at that 
moment. Note that S'^~^ -\-Xri+i must be independent by poset matroid property 
(Ml), as 5'*“^ -I- Xri + Xr i +1 is independent. Therefore, 

P ,_1 = +Xr,)- + x,(+i) - 

> -I- X^i -I- X^i+l) - +Xri) = Pi, 

where the second inequality follows from DR-submodularity. 

Otherwise, Xj.t ^ Xj.t+i and pi-i > pi follows directly from DR-submodularity. 
Moreover, by monotonicity of /, Pi > 0 for all i. Therefore, we can apply Lemma 
[9] for Pi and ai. 

Now, we consider only the elements of OPT that are not chosen by our algo¬ 
rithm, ordered as in the linear extension given by xi,..., Xn- Let {ai,..., a^} = 
OPT \ S, ordered the same way as the Xi. We will partition this set into the 
subsets contributing to the different at and to simplify notation we define these 

subsets as OPT{S, t) := OPT \ S' fl { *X y’ £ ^ ^ 1 1} ' 

Now, we first use monotonicity of / and then rewrite. 
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I 

fiOPT) < fiOPT US)= f{S) + Y^ifiS U {ai,... aJ)-/(5 U {ai,... a,_i})) 

K 

= f{S) + Y Y (/('5'U{ai,...ai})-/(5'U{ai,...ai_i})) 

t=l ai^OPT(S,t) 

</(^)+E E {f{S^-^+Xr^)-f{S^-^)) 

ai^OPT{S,t) 

K 

= f{S) + Y - /(5*-')) 

f{S) + E(/(5‘”' + - /(^‘■') = 2/(5) - /(0) < 2/(5) 

t=i 

Inequality (*) is a consequence of DR-submodularity and the Greedy property: 
Consider some S OPT{S,t). If ^ at, the inequality we use follows 
directly from DR-submodularity as 5*“^ C 5. Otherwise, ai and x^t are in¬ 
comparable and there is a minimal element a G {x^*,..., x^,} with a ^ ai. By 
DR-submodularity, /(5 U {oi,..., Oi}) — /(5 U {ai,... ai_i}) < /(5*“^ -I- a) — 
/(5*“^). But adding Xr(t) maximized the increase in / and thus, this is at most 
-I- Xrt) — /(5‘“^) (as a is not considered earlier in the algorithm). 
Inequality (=i=*) is a direct consequence of Lemma [S] with a and p as defined 
above, and we already showed that the requirements for applying the lemma 
are fulfilled. 

This proves that our Algorithm is a 1 / 2 -approximation. 

If the poset matroid is the uniform matroid, the constraint is a cardinality 
constraint, i.e. |5| < k. For that case, we can use a similar argument to obtain 
a better bound: Consider some Si and define ai,... as before with Si instead 
of 5, i.e. {oi,..., at} = OPT \ Si. Clearly, i < k. 

Analogous to the previous inequalities, we can obtain 

£ 

fiOPT) < fiOPT U 5,) = f{Si) + YifiS U {ai,..., a,})-/(5 U {ai,.. .a,_i})) 

i=i 

i 

< fiS^) + E(/(^* + Xr}- fiS}) < fiS} + fc(/(5, + X,.) - fiS}) 
t=i 

As shown in |18| . this inequality directly implies that our Greedy algorithm is 
a (1 — i/e)-approximation. For the sake of completeness, we include the proof 
here as well. 

Define 6i := fiOPT) — fiS} and then rewrite the above inequality: 6i < 
kiSi — (5i+i). We rearrange to obtain Si+i < (l — ^)Si. It follows inductively 
that 6j < (1 — ^)^^o for all j. Moreover, non-negativity of / implies (ig = 
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f{OPT) — f(il}) < f(pPT). Now we use the fact that 1 — x < e “ for all x G K, 
resulting in 

5, < (1 - < e-ifiOPT). 

Resubstituting 6j then yields f{Sj) > (1 — e~^)f{OPT) which implies the 
approximation guarantee. □ 

The same technique as in the proof of theorem |8] can also be used to generalize 
this result to a collection of s poset matroid constraints to obtain an approxi¬ 
mation ratio of 

s+l 

4.2 Unconstrained Mciximization 

In this subsection, we again consider the randomized Double Greedy algorithm 
by Buchbinder et al. [6]. We will show that DR-submodularity seems to be a 
crucial property to obtain good approximation guarantees using that algorithm. 
In contrast to SMBIL, all proofs from [6] can easily be generalized to obtain a 1 / 2 - 
approximation for DR-SMDL. Note that this only results in a pseudopolynomial 
algorithm for the integer lattice. However, Soma and Yoshida showed how to 
adapt the Double Greedy to obtain a polynomial randomized Y 2 -approximation 
for DR-SMBIL [35]. 

Theorem 10. There is a randomized 2 -approximation for DR-SMDL. 

We start by describing the modified algorithm, for a formal description see 
Algorithmic To avoid complicating it, read ^ = 0 in the algorithm. In contrast 
to the algorithm for SMBIL we presented in Section |C we will only add or 
delete one element at a time (which corresponds to adding or subtracting I for 
SMBIL). Given a distributive lattice D{P) on a poset P = (A/”, ^), we fix a 
linear extension of P to obtain a total ordering xi,..., x„ of Af. Starting from 
A = 0 and B = N, we always consider a pair (xmin,Xmax) consisting of the 
smallest available element in the ordering and a maximal element Xmax such 
that the pair is comparable in P. Note that Xmin = Xmax is permitted. Then, 
we randomly choose to add Xmin to A or delete Xmax from B with probabilities 
proportional to the increase in / and remove the chosen element. This continues 
until A = B. DR-submodularity is essential here. Without it, the example 
presented in Figure U] with the linear extension c, d, a, b shows that Algorithmic 
would perform arbitrarily badly even for the integer lattice. 

Adapting the arguments of [5] to our setting is straightforward. We start by 
showing that for at least one of the two options, / does not decrease. 

Lemma 11. In every iteration of Algorithm\^ Oi -\-bi > 0. 

Since A G B — Xj and Xk < Xj, the above expression is just a reformulation 
of DR-submodularity and thus the Lemma follows. We fix an optimal solution 
OPT and let Ai and Bi be random variables denoting the sets A and B of 
elements after the i-th iteration. We show that for DR-submodular functions. 
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Algorithm 3: Greedy with Matroid Constraint 
Input: A nonnegative DR-submodular function / on a distributive 
lattice T){P) 

Output: A set A G I’(P) 

A = %, B = J\f, xi,... ,Xn a linear extension of P, i = 1 
k = 1, j = max{j' : Xk ^ Xj'} 
while A ^ B do 

Oi = f{A + Xk) - /(A) 
bi = f{B-Xj) - f{B) 

With probability inax{aT.oy+m2{fe..o} set A = A + /c = fc + f and 
find new j = max{j' : Xk A Xj’}. 

With complementary probability set B = B — Xj, find new j for k. 

return A 


we can get a good bound on the expected decrease in f{OPTi), where the 
random variable OPT^ is defined as {OPT U A^) n 

Lemma 12. For every iteration i, we have E[f{OPTi-i) — f{OPTi)] < ^ • 
P[/(A,) - /(A,_i) + /(P,) - /(P._i)]. 

Then, as presented in Section |31 by summing over all iterations and collapsing 
the resulting telescopic sum, we obtain the proof of Theorem [TUI It remains to 
show the lemma. 

Proof. The proof works completely analogous to |^. We will give the details here 
and omit them for similar results. It suffices to prove the inequality conditioned 
on Ai_i = Si-i for a set Si consisting of elements Xmin and a corresponding 
series of pairs {xmin,Xmax) when the probability that this series of pairs leads 
to Ai = Si is non-zero. We now fix such an event. Then, Pi_i, OPTi-i, ai,bi 
and Xmin, Xmax become constants. By LemmajTTJ Oi or bi must be nonnegative. 
Therefore, it suffices to consider the following cases. 

Suppose bi < 0 and thus Oi > 0. Then, Ai = Ai_i -|- Xmin and OPTi = 
{OPT U {Ai—i Xmin')') bl Bi—\ and Xmin,Xmax ^ Pz- If Xmin ^ OPT^ WC have 

0 = f{OPT,_i) - f{OPT,) < f. If Xm^n 0 OPT, then 

/(OPTz-i) - f{OPT) = /(OPP-l) - f{OPP-i + Xm^n) 

< /(Pz-1 - Xmax) - f{Bi-l) = 6z < 0 < ^ 

by DR-submodularity. The case where < 0 is analogous. 


22 







Therefore, suppose Oi > 0 and bi > 0. Then, 


E[f{A,) - /(A,_i) + fiB,) - /(B.-i)] 

(/(^j-l+2^mm)~/(^i-l)) H- —T if ~ ^max) — f (Bi-i)) 


Cii + bi 

al + bl 


Ui + bi 


Qi + bi 

On the other hand. 


( 7 ) 


£;[/(OPT,_i) - /(OPT,)] 

:^V(/(OPT,_i)-/(opr,_i + x™„)) 
m + 

+ —^(/(OPr,_i-/(OPT,_i - a;™,,)) 
a* + bi 


Qi + 6i 


We still need to explain the final inequality. If Xmax ^ OPT, the second term 
on the left hand side equals zero. For the first term , we use DR-submodularity 
to obtain f{OPTi-i)-f{OPTi-i + Xmin) < /(Pi-l - Xmax) - fiPi-i) = bi. 
Otherwise, Xmin G OPT as OPT is an ideal and thus, the first term is zero. 
Moreover, DR-submodularity implies f {OPTi-i)—f {OPTi-i—Xmax) < /(^i-i + 

^min) /(^2—l) ~ ^2- 

Now the inequalities ([S]) and ([7]) together with the fact that \ 

yield the desired result. □ 


The deterministic version from which yields a ^-approximation can also be 
generalized to this case. In contrast to the non-DR-setting in Section |31 the 
proofs from by Buchbinder et al. can be directly generalized together with 
the ideas used in this section. Therefore, we will not present them here. 

As shown in [5], there is also a deterministic Y 2 -approximation based on the 
randomized algorithm. The rough idea is the following. They maintain a set 
of possible “states” of sets with corresponding probability. For example, in 
the beginning, there is exactly one possible state: A = % and B = Af. In 
iteration i, they consider element Ui and for each current state {A,B), they 
assign probabilities to adding Ui to A or deleting it from B. This way, they 
build the set of states for iteration i. For example, after one iteration, the 
set of states would then be (iti,A/") with probability p and (0,A/" \ ui) with 
probability 1 — p. The groundbreaking idea in this paper is how to assign these 
probabilities. There are two aspects to this. First, the set of states should 
stay small, in particular, doubling its size in every iteration is not an option. 
Second, the expected decrease in the objects OPT {A, B) corresponding to states 
(A, B) should be small. Both these properties are achieved by determining the 
probabilities using a linear program. The first property follows from the fact 
that an extreme point of their linear program must have small support. The 
second property is enforced by inequalities in the LP. 
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Their strategy can also be generalized to the DR-submodular setting on distribu¬ 
tive lattices. But, there is one additional technical issue. In (5, the element 
that is to be added or deleted is the same for all states in an iteration i. For our 
setting, each state {A,B) also comes with its own pair Xmin A Xmax- However, 
the objects OPT {A, B) used for the analysis also depend on the state and for a 
fixed state we still obtain the necessary properties like OPT {A, B) C B\ Xmax- 
Therefore, we can use DR-submodularity in the same way as [S] to obtain a 
deterministic 1 / 2 -approximation for SMDL. Since the technical details are very 
similar to the ones already provided we will not present them. 

It seems natural to ask whether the approach presented in Section[3]for SMBIL, 
i.e. for non-DR functions, could also be generalized to distributive lattices. This 
would mean choosing a best join-irreducible element of the lattice to add to A 
and a best complement of a meet-irreducible element to delete from B. Which 
elements would be eligible to be chosen would be subject to discussion. But, 
using such a strategy, it is possible to end up in situations where there is no 
possibility of changing H or R as described above without decreasing their values 
w.r.t. /. 

5 DR-submodularity on Distributive Lattices: Hard¬ 
ness 

As the previous algorithms maximizing DR-submodular functions on the dis¬ 
tributive lattice were direct generalizations of algorithms for the Boolean lattice, 
one could assume that there is no significant structural difference. However, in 
this section we consider the problem of maximizing a monotone DR-submodular 
function / : T’(P) —>■ K+ on the distributive lattice subject to a knapsack con¬ 
straint with binary knapsack weights. Formally, 

max{/(S') : w{S) < R} for u; : A/" —>• {0,1} (9) 

This problem is essentially as hard to approximate as the Densest fc-subhypergraph 
problem and thus, in particular there is no constant factor approximation un¬ 
der some complexity assumptions. In contrast, for monotone functions on the 
Boolean lattice, there is a (1 — i/ej-approximation for one (arbitrary) knap¬ 
sack constraint [3S] and a (1 — i/e — eVapproximationf |26| 1 for 0{1) Knapsack 
constraints. Moreover, [34] generalized these results to the integer lattice and 
obtained a guarantee of (1 — I/e —e) for monotone DR-submodular functions and 
positive knapsack coefficients and a pseudopolynomial (1 — Ye)-approximation 
for monotone submodular functions [23]. Note that due to monotonicity of /, 
the result directly generalizes to nonnegative knapsack values. 

We start by introducing the densest fc-subhypergraph problem (DfcSH). Given 
a hypergraph H = (V, E) and fc € N, the goal is to choose k vertices such 
that the number of induced hyperedges is maximized. One special case is the 
densest ^-subgraph problem for which, although it is believed to be hard, it is 
only known that no PTAS exists if NP has no subexponential algorithms [25] . 
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However, Hajiaghayi at al. showed in m that DA:SH is hard to approximate 
within a factor of for some (5 under the assumption that 3 — SAT ^ 

DTIMEiT^^^ ^ ), i.e. 3 — SAT cannot be solved in time 2"^^ ^ . 

Theorem 13. If there is an a{n)-approximation for maximizing a monotone 
DR-submodular function on a distributive lattice subject to a 0-1-Knapsack con¬ 
straint, then there is a 2a{n'{m' + 1))- approximation for DkSH, where n and 
(n', m') denote the respective input sizes and a is positive and non-decreasing. 

Proof. Let H = (V,E),kGNhe the input of DfcSH. Our goal is the construction 
of a submodular maximization instance. We construct a poset P = (A/”, by 
introducing an element v for every vertex v € V and k elements 
for every hyperedge e € E. Slightly abusing notation, we are also going to 
denote by V the subset of elements in Af corresponding to vertices in H. Then, 

V < e^ \/i < k,v € V ii V € e in H. Moreover, f{S) = [S'! which clearly is 
DR-submodular and monotone and n = n' -\- km' < n'{1 + m'). The knapsack 
constraint is given by asking that [S' D Rj < fc, i.e. B = k and w{v) = 1 for 

V € V and w{e) = 0 otherwise. 

Any ideal I G D(P) corresponds to the set of vertices IllV and some hyperedges 
induced by this set. On the other hand, any induced subhypergraph in H can 
be represented (not uniquely) by an ideal. In particular, there is a one-to-one 
correspondence between induced subhypergraphs and ideals / where / \ V is 
maximal. 

Let S be the result of an Qf(n)-approximation for our problem. Since the knap¬ 
sack constraint only affects the elements in V, we assume w.l.o.g. that S contains 
either all copies e* for an edge e or none of them and that liSriHI = k. Therefore, 
S' n H corresponds to a feasible solution for DfcSH inducing =: P edges. 

We can assume that /3 > 1. If not, we construct a solution fulfilling that easily. 
Let TZ denote the optimal value for DfcSH. Then, the optimal value for our 
submodular maximization instance is fc(l -I- TV). Since S was obtained by an 
a(n)-approximation, /(S) = |S| > + TiP- Combining the above, we 

obtain 

fc(l + 11) < a{n)\S\ = a{n){P -\- l)fc < a(ri)2/3fc. 

Therefore, 2a{n'{m' -\- l))/3 > 2a{n)P > TZ and thus, we have a 2a{n'{m' + 1)) 
approximation for DfcSH. □ 

We could refine the constant of 2 in the previous analysis: For any constant c 
we can find a solution of DfcSH of value at least c by enumeration. (Unless the 
optimum is less than c, then we solve DfcS optimally.) If our algorithm performs 
worse, we take the solution of size c instead and therefore can assume /3 > c. 
This leads to a (1 -I- ^)a{n'{m' + l))-approximation in the above analysis. 
Using the above theorem we obtain: 

Corrolary 14. The submodular maximization problem defined in is hard to 
approximate within a factor 0 / 2 ^^°®*^" ^ for every constant <5 > 0 under 

the assumption that 3 — SAT ^ DTIME{2'''^^ * ) for every constant e > 0. 
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Proof. We need to examine the hardness results for DfcHS from m a little 
closer. The authors reduce from a problem presented in [12], the so-called 
Maximum Balanced Complete Bipartite Subgraph Problem. However, they 
omit the fact that the input graph for that problem is not just bipartite but 
balanced as well. This ensures that the complexity result for DfcSH is true even 
if the number of hyperedges is equal to the number of nodes. Moreover, m 
state their result for all constants <5, not just for some constant as cited and 
used in m, which is why we also use this more general formulation here. 
Therefore, we can now apply Theorem [T3] for a hypergraph with n' = m' and 
use the same reduction as above. Therefore, n < + n' < (n' -I- 1)^ and thus 

n' > y^n) — 1. Therefore, by setting a{n) = we obtain our 

result. □ 
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